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Abstract We present detailed clustering analysis of a large K-band selected local galaxy 
sample, which is constructed from the 2MASS and the SDSS and consists of 82, 486 galaxies 
with 10 < K < 13.5 and 0.01 < z < 0.1. The two-point correlation function of the 
magnitude-limited sample in real space at small scales is well described by a power law 
^(r) = (r/6.44 ± 0.23)~^'^^*° '^^. The pairwise velocity dispersion is derived from the 
anisotropic two-point correlation function and find the dispersion (T12 = 685 ± 17 kms^^ if 
its scale invariance is assumed, which is larger than values measured in optical bands selected 
galaxy samples. We further investigate the dependence of the two-point correlation function 
and the au on the g — r color and the iiT-band luminosity, obtain similar results to previous 
works in optical bands. Comparing a mock galaxy sample with our real data indicates that 
the semi-analytical model can not mimic the (T12 in observation albeit it can approximate the 
two-point correlation function within measurement uncertainties. 

Key words: galaxies: statistics — infrared: galaxies — cosmology: large-scale structure of 
universe 



1 INTRODUCTION 

The track of galaxy formation and evolution keeps still one of the pivotal but intricate problems in modern 
astrophysical research. While there are large portion of prominent results mainly derived in optical bands, it 
is worth of addressing that observations of galaxies at distinct wave bands actually depict galaxies' intrinsic 
properties in different aspects, which is essential to our attempt to an unbiased understanding of the for- 
mation and evolution of galaxies. The Near-Infrared (NIR) observation is of particular interests. A K-band 
(2.15/im) selected galaxy sample has several advantages in studying galaxy. The K-correction in the K- 
band is a relatively small and better understood quantity comparing with the optical K-correction, galaxy's 
K-band light is 5 - 10 times less sen sitive to dus t and stellar populations, and moreover is independent of 
galaxy spectral types at z < 1 (e.g. iMobasher e t al., 1986; Cowie et al., 1994). Therefore K-band obser- 
vation enables a much homogeneous sampling of galaxy types. It is even advantageous that the K-band 
luminosity is tightly correlated with galaxy's stellar mass, so that the analysis at K-band is a better probe of 
galaxy properties and relevant evolution history related to the stellar component than those measurements 
in B- and r-bands. 

Studying clustering strength using the two-point correlation function (2PCF), of galaxies at different 
redshift bins is one of the most straightforward and effective methods of exploring how galaxy proper- 
ties are related to the underlying dark matter distribution and henceforth to the macro history of galaxy 
formation and evolution. For example, the clustering of galaxies selected at 3.6/im with median redshift 
z 0.5 offered by the Spitzer Wide-area Infrared Extragalactic (SWIRE) su rvey is directly compare d 
to the results of local galaxy samples selected at K-band centered at 2.15/im ("Waddi ngton et dl l2O07l) . 
Unfortunately in contrast with prolific photometric galaxy surveys available in NIR bands, spectroscopic 
information of these galaxies falls behind, it is not strange to see that majority of the clustering analysis 



2 



B. Ma et al. 



of NIR galaxies resorts to angula r correlation fun ctions uj{9) (e.g. iBaugh et al.l 1 19961: iMcCracken et al.L 
[2000; Roche et al., 2003; Waddin gton etaUl2007l) . the spatial correlation function ^ (r) is applicable only 
occasionally (ICarlberg et al.l,ll997l) . 

In principle one can invert the angular 2PCF t o the real space spatial correlation function through 
the Limber's equation (see p. 194 of iPeeblesi Il980l) . The problem is the precision of the inversion relies 
significantly on the radial selection function of galaxies and the assumptions of small angle approximation 
and a power law ^(r), which introduces apparent uncertainties to its interpretation (Bernardeau et al. . 20021) . 
e.g. setting up the empirica l evolution model of the real space two-point correlation function by ^(r, z) = 
(r/m)-'^ X (1 + zV-^-' (|Phimppsetal.[ll978h . 

Thanks to the Two-Micron All Sky Surve y Extended Sourc e Catalogue (2MASS XSC IJarrett et all 

|2000|) and the Sloan Digital Sky Survey (SDSS lYork et al.ll2000h . we are able to construct a large K-band 
selected local galaxy sample to measure spatial two point correlation function, and thus to built cluster- 
ing reference of local NIR galaxies. By combining two-point correlation functions in both redshift and 
real spaces, we are also able to derive quantities on the galaxies' peculiar velocities which are largely ig- 
nored in characterizing clustering of galaxies. Actually to appropriately descibe galaxy distribution, we 
should work in the phase space supported by both of the position and the peculiar velocity, especially in 
the nonlinear regime. Thus measurement of the pairwise peculiar velocity dispersion is robus t, providing an 
important statisti cs to i dentify galaxy popula tions and test galaxy formation models (e.g. iZhao et al.l 120021: 
lYoshikawa et~ail l2003l lling & Borne a i2004l:ISlosar et al.Ll2006HLi et al.Ll2007h . 

A brief summary of the sample used is presented in Section 2, as well as the estimation method of 
2PCF, including technical issues such as random sample construction, correction to the fibre collision. In 
Section 3 we present measurements of the full, flux-limited sample in both redshift- and real-spaces, in 
together with the redshift-space distortion parameter and the pairwise velocity dispersion. In Section 4 
the full sample is divided into different sub-samples according to their color and luminosity to probe the 
clustering as functions of these properties. 

Throughout this paper, galaxy distances are obtained from redshifts assuming a cosmology with ftm = 
0.3,51a = 0.7, quoted in units of h^^Mpc. Absolute magnitudes quoted for galaxies assume h = 1 to 
avoid the —5 logio factor. 



2 SAMPLES AND ESTIMATION PROCEDURE 
2.1 the K-band sample of local galaxies 

Our K-band galaxy sample is selected from the 2MASS XSC. R edshifts for thi s smap le are obtained from 
the SDSS. The two catalogues actually have been combined bv iBlanton et al.l (l2005l) into the New York 
University Value-Added Galaxy Catalog (NYU- VAGC)Q to form a local redshift sample(mostly below z ~ 
0.3)0, with a coverage of 9938 deg^ for photometric imaging and 6750 deg^ for spectroscopic observation. 

The c ompleteness of the cross-matched 2MASSH-SDSS Catalogue has been extensively discussed (e.g. 
iBell et all [2003.) . Following these practices, we select the galaxies by the extinction-corrected Kron mag- 
nitudes in the range 10 < K < 13.5, and redshifts in the range 0.01 < z < 0.1. In the apparent magnitude 
limits adopted here, the bright end is chosen to avoid incompleteness due to the large angular sizes of 
galaxies, while the faint end is such to match the magnitude limit of 2MASS. We also use a low redshift cut 
rejects galaxies with redshifts seriously affected by Hubble flow. The final flux-limited sample, our main 
sample, accumulates 78,339 galaxies with redshifts in total, with a median redshift ^ 0.06. 

The K-selected sample is further divided into two sub-samples according to their SDSS color (g — r): 
the blue sample with (g—r) < 0.7 and red sample with of (g—r) > 0.7 (Figure[T]andTable[T]i. Both red and 
blue samples have similar space densities, but galaxies in the red sample are systematically more luminous 
and therefore mainly inhabit at higher redshift: the median redshift of red galaxies is Zmed ^ 0.065 whereas 
for the blue z^ed ^ 0.054. 

A set of volume-limited sub-samples within different absolute magnitude bins are also constructed for 
analysis (Table|2]i. 

' |htt p : //sdss .physics ■ nyu ■ edu/vagc| 

2 The version of the NYU-VAGC used in this paper is the SDSS DR6 jAdelman-McCarthv et aill200a) . 
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Fig. 1 The K-corrected g - r color vs. K-band absolute magnitude for all galaxies. The vertical 
line demarcates a simple cut at g - r = 0.7 



Table 1 The K-selected samples including both red and blue populations. All samples are in 
the range of 10 < X < 13. 5 and 0.01 < z < 0.1. M* and a are fitted parameters of the 
Schechter luminosity function ( SchechteJ. 1 1 976h . rg and 7 are obtained from fit to the projected 
2PCF Wp{(j). The numbers of galaxies quoted here include those lack of redshift due to fibre 
collision. 



label 




M* 


a 


ro(/!."^Mpc) 


7 


P 


Main 


82,486 


-23.38 


-0.94 


6.44 ± 0.23 


1.81 ±0.02 


0.39 ±0.17 


Red 


46,689 


-23.31 


-0.48 


7.62 ± 0.32 


1.87 ±0.03 


0.35 ± 0.20 


Blue 


35,797 


-23.08 


-1.11 


4.92 ±0.14 


1.67 ±0.02 


0.50 ±0.15 



Table 2 Volume-limited sub-samples and their 2PCF parameters, n is the mean number density 
in unit of 10^'^/i'^Mpc^'^, and ro is in unit of /i^^Mpc. All galaxy samples are of 10 < A' < 
13.5, 0.01 < z < 0.1. ro and 7 are obtained from 'Wp{a), the numbers in parentheses are values 
of corresponding luminosity sub-samples of the mock galaxy catalogue. 



Abs. Mag. 


Redshif 


range 




n 


ra 


7 


-25- 
-24- 
-23- 
-22- 


-24 
-23 
-22 
-21 


0.033 - 
0.021 - 
0.013- 
0.010- 


0.100 
0.067 
0.042 
0.026 


14,152 
19,714 
8,969 
2,232 


0.87 
3.83 
7.08 
7.19 


7.17 ±0.30 (6.93 ±1.10) 
5.52 ±0.28 (4.97 ±0.23) 
5.23 ±0.65 (4.50 ±0.24) 
4.46 ± 0.57 (4.79 ± 0.29) 


1.87 ± 0.03 (1.82 ±0.07) 
1.83 ± 0.03 (1.87 ±0.03) 
1.83 ± 0.04 (1.86 ±0.04) 
1.86 ±0.10 (1.92 ±0.04) 



2.2 mock galaxy catalogue 

We also compare our sample with a mock galaxy catalogue to scout for the performance of semi-analytical 
models for hierarchical galaxy formation (see Baugh, 2006, and references there in). The mock catalogue 
is derived from a high-resolution pure dark matter cosmological simulation with 512'^ particles in a box of 
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size 100 /i ^Mpc run by the GADGET2 code (ISpringel[l2005l) . The fundamental cosmological parameters 
for the simulation are set as f^A = 0.762, flm = 0.238, h = 0. 73, gs = 0.74, n = 1, rest simulation 
parameters are the same as those in the pure dark matter run by iLin et al] (l2006l) . e.g. the particle mass 
is 5.5 X IO^H'^Mq and the soft length is 4.5ft.^^kpc. We generate the mock galaxies at z — 0.05 from 
the output of the simulation by the semi-analytical model of iKang et al.l ( l2005h . Luminosities are the only 
derived property for the mock galaxies, thus we divide the mock galaxies into a set of luminosities bins and 
study galaxy and dark halo properties in each bin. 



2.3 estimation of the two-point correlation functions 

Two types of 2PCFs for this sample are measured to capture the clustering patterns: one is the isotropic ^(s) 
with s denoting the separation of a pair of galaxies in redshift space, and the other is the two-dimensional 
function £,{a,Tr = \/ — a^) with vr indicating the separation of a pair of galaxies parallel to the line-of- 
sight and the a being the separation perpendicular to the line-of-sight. The latter is mainly utilised to obtain 



the redshift distortion free function, the projec ted 2PCF w„(g) ^ f 7r)d7r, and consequ ently the real 
space function ^(r) after proper inversion (e.g. lDavis & Peeblesl ll983VlHawkins et al.Ll2003l) . 



The 2PCFs are measured using the estimator of iLandv & Szalavl (Il993h 



DD- 2DR + RR 

^ ^ RR ' 

in which DD is the normalised number of weighted galaxy-galaxy pairs at given separation, RR is the 
normalised number of random-random pairs within the same separation in the random catalogue and DR is 
the normalised number of weighted galaxy -random pairs. In general the scale s of ^(s) is binned uniformly 
in logarithm scale, and for f (ct, tt), a and tt are binned in linear scale. 

To proceed the estimation with Eq. [T] an auxiliary sample of completely random points in the exactly 
the same geometric window as the galaxy sample is prerequisite. The random samples should have the same 
redshift, magnitude and mask constraints as the real data, with a smooth selection function matching the 
N{z) of the real data. The luminosity functions of the flux -hmited sa mples, used to generate the selection 
function, are computed with the STY method (Sandag e et al.l 11979.) in form of the Schechter function 
(Table [T]!. We generate a random sample ten times larger than the observed K-selected sample, and the 
random samples for red and blue samples are 15 times larger than observed ones. 

A weight is assigned to each galaxy and random gmnt a ccording to th eir redshift and angular position 
to minimize the variance in estimated ^ (iEfstathioui [19881 iHamiltonn 1 9931) . 



(2) 



l + 47rn(z)$, J3(s) ' 



where $i is the selection function at the location of ith galaxy, n{z) is the mean number density, and J3(s) = 
^(s)s^ds. The Jz{s) is computed using a power-law ^(s) with correlation length sg = 12 /i^^Mpc and 
7o — 1, we find that the difference between our es t imate and that using a raw measured ^(s) is negligible, 
in agreement with the conclusion o f lHawkins etan(l2003h . To normalize the pair counts properly, we assure 
that the sum of weights of the random catalogue equal the sum of weights of the real galaxy catalogue, both 
are a function of scale. 

We need to correct the incompleteness in the spectroscopic sample due to the effect of collided fiber 
constraints. The design of the SDSS instrument means that fibers can not be placed closer than 55 arcsec 
on the same tile, members of a close pair of galaxies cannot be targeted in a single fibre configuration 
so that the survey misses a large fraction of close pairs. Because 2PCF will be systematically underesti- 
mated without taking account of fiber col lisions effect, several meth ods are developed to correct this bias 
dZehavi et all 120021: iHawkins et all l2003h . We adopt the method of IZehavi et"an (12002') by assigning the 
redshift of the observed galaxy in a pair to the pair member whose redshift was absent. Then we obtain 
"collision corrected" redshift. At large scales, where both members of the pair contribute to the same sep- 
aration bin, this method is equivalent to double weighting. We argue this method s hould pe rform better on 
small scales because it retains information about the known angular positions. Ze havi et al.l ([2005) showed 
that this method is an adequate treatment: residual systematics for the redshift space correlation function 
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were considerably smaller than the statistical errors, and this was even more true for the projected 2PCF 

Wpia). 

The covariance matrix is computed with the jack-knife technique dLuptoni 1 1 9931; IZehavi et all l2002h . 
the galaxy sample is splitted into thirty separate regions of approximately equal sky area, and then we per- 
form the analysis thirty times, each time leaving a different region out. The estimated statistical covariance 
of 2PCF measured in two bins of i and j is then 

7V-1 ^ _ _ 

C0V(C», Cj) = - mj.i - Cj) , (3) 

i=l 

in which N is the number of jack-knife sub-samples. 



3 CLUSTERING OF THE MAIN SAMPLE 

3.1 2PCF in redshift space 

The first one we calculated is the 2PCF ^(s) for the main sample from s = 0.01 /i^^Mpc to s = 
100 h~^Mpc in redshift space, s is divided into equally logarithmic bins of width A \ogiQ{s/ h~^Mpc) = 
0.125. ^(s) is not a single power law at all scales (Figure|2|i, rather 

^ ^ (11.69 ± 0.89, 0.98 ± 0.02), z/sG (0.1,3); 

^^^^~^so' ' (7.64 ±0.41, 1.60 ±0.05), z/se (3,10). ^ ' 

It is known that ^(s) also consists of the contribution from the galaxy peculiar velocities causing the 
redshift distortion, in addition to the true spatial correlation of galaxies (Kaiser, 1987; Hamilton, 1998). 
Thus we need to break the degeneracy between the spatial clustering and the velocity correlation before 
making direct comparison of ^(s) in different redshift bins. 

3.2 The projected 2PCF 

Galaxy's peculiar motion only cause drifting of the radial position. To minimize this effect, we can calculate 
the correlation function as a function of cr and tt, where a is perpendicular to the line-of-sight and tt is 
parallel to the line-of-sight. Then the projection of ^(cr, vr) onto the a plane is independent to redshift space 
distortion and gives the information of real space correlation function. 

The effect of redshift distortion is clearly seen in the ^{a, tt) contour of the main sample in Figure|2] the 
contours are elongated along the line-of-sight direction at small a separation, exhibiting the phenomenon 
of fingers-of-God by the random pairwise ve locity; at large a scal es, the contours are squashed in the tt 
direction due to gravitational coherent inflow (iHawkins et al.Ll2003l) . 

Integrating the anisotropic ^(cr, tt) over tt gives the projected 2PCF, 

^(cr, TT)dTr = ^ ^(cr, TTi)ATT, , (5) 

which has practically an upper limit TT^ax — 50 h^^Mpc. We tested that there is little difference if using a 

larger cutoff. 

Wp is related to the 2PCF in real space through the Abel transform ( lDavis&Peebleslll983h 

where the inversion renders ^(r) assuming a step function-like Wp{a) — Wi at each bin centered at Ci 
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CT / h ' Mpc r / Mpc 

Fig. 2 2PCFs of the main sample. Top-left: ^ in redshift space, the dot-dashed line is the power 
law at scales 0.1 < s < 3 h^^Mpc and the dashed line is the fit at scales 3 < s < 10 h^^Mpc 
(Eq. Top-right: ^((T, tt) contours, a and tt are separations perpendicular and parallel to the 
line of sight respectively, solid contours are measurements at levels of 4, 2, 1, 0.5, 0.25, 0.125 
outwards and dotted lines are the isotropic 2PCF expected in the absence of redshift-space dis- 
tortion. Bottom-left: projected 2PCF Wp(cr)(solid points), the solid line is the power law fit. 
Bottom-right: 2PCF ^(r) in real space inverted from Wp through Eq. |7] solid line is the power 
law transformed from the fit to the Wp on the left. Error bars in these plots are the la scatters of 
jackknife sub-samples. 



for r = at dSaunders et aU[r992l) . Mathematically the inversion is not stable but in practice works well in 
r < 30/i~^Mpc. If we simply assume ^(r) ~ {r/ro}^'' , the integral in Eq. |6]can be done analytically, 
yielding 

Wp = Aa'-\ wzth A = r^^T{l)T{2-l)/r{l), (8) 
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Table 3 C lustering of galaxies selected at different bands, r-b and (SPSS) r esult comes from 
IZehavi et al. (2002, 2005) and the 6.7-band (2dFGRS) is given bv lHawkins eTa l. ( 2003) . 



band 




ro (/i^'Mpc) 




(T12 ( kms ^) 


Ks 


82,486 


6.44 ± 0.23 


1.81 ±0.02 


685 ± 17 


r 


154,014 


5.59 ±0.11 


1.84 ±0.01 


640 ± 60 


bj 


165,659 


4.95 ± 0.27 


1.72 ±0.04 


506 ± 52 



where T is the Gamma function. It is true that Wp for our main sample is a power law function at small 
scales (Figure |2]i, the best-fit parameters are 7 = 1.81 ± 0.02 and tq — 6.44 ± 0.23 in the regime of 
0.1 h-'^Mpc < (T < 18 h-^Mpc. 

Table[3]lists the results of our K-selected sample and those selected in the other bands for comparison: 
bj of 2dFGRS and r of SDSS. We conclude that the correlation function will have larger amplitude and 
steeper slope if the galaxies are selected at longer wavelength band. 

3.3 the pairwise velocity dispersion 

Currently there is no precise theory on the full scale range redshift distortion ('Scoccimarro', '2004V But 
we can still approach to this topic with reasonable assumptions. Intuitively redshift distortion can be ap- 
proximated by certain convolution of two components dominated in different regimes, coherent infall is 
responsible for the clustering enhancement at large scales while the smearing of correlation strength at 
sm all scale s is attributed to random motions. 

lKaiseil(tl987b found that at large scales the boost to the power spectrum by the pecul iar velocities take s 
a particularly simple form, which was later completed and translated to the real space bv lHamiltonI ( Il992l) . 

^'{a, tt) = 6(s)Po(/") ± Us)P2{f-i) + Us)P4{f^) , (9) 

where Pe{n) is Legendre polynomials, /i = cos{9) with 9 being the angle between r and tt. Assuming 
^ — (r/ro)"''' then renders 

(«{') = ({') = (1 + Y + x) «'■) 



3 7 y V7 

8/32 f 7(2 + 7) ^ 

where s and r have the correspondent values, f3 is the linear redshift distortion parameter, f3 « and 
b is the linear bias parameter. The first equation is independent on the form of ^(r). 

To incorporate effects of random motion, the anisotropic 2PCF in redshift space is then approximated 
by the convo lution of the ^'(tr, tt) by Eq. |9]with the distribution function of the pairwise velocity /(W12) 
(c.f. ,Peeblel[l993h . 

/+00 „, 
^'{a,7r-^)f{v,,)dvi2, (11) 
-00 ^0 

and in general /{vu) is assumed to be an exponential distribution of dispersion (J12 

ft ^ 1 ( ^^^l2 ^ 

= TS'^^P • (12) 



V2 \ <J12 



After measuring f (s) and ^(r) at large scales, we can obtain the redshift distortion parameter j3 via the 
first equation in Eq.[TO]easily. Then we combine Eq.|9]- Eq.[T2]to fit the ^(<t, vr) data grid to determine other 
model parameters. However there are implicit assumptions in the prescription: (1) the linear bias parameter 
b is forced to be scale-independent; (2) the pairwise velocity dispersion 0-12 is presumed invariant to the 
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Scale /h 'Mpc o/h 'Mpc 

Fig. 3 Left: ratios of ^is)/£,{r) of the full sample (solid circles) in comparison with that of the 
sample without the Sloan Great Wall (open triangle), the dotted line corresponds to /3 = 0.39. 
Right: the derived pairwise velocity dispersion au (cr). Error bars are obtained by jackknife re- 
sampUng. 



separation tt along the line-of-sight but could be a function of the perpendicular separation a. Therefore the 
resulting cri2 in the model is not the actual true pairwise velocity dispersion. To have meaningful comparison 
with simulations, we need to estimate the (T12 from the 2PCFs of the simulation data in the same way as of 
the galaxy sample. 

Figure |3] shows the ratio of the redshift-space 2PCF to the real-space 2PCF, and the derived (T12 as 
function of a. ^{s)/£,{r) increases with scales, and becomes roughly constant in the range 4/i~^Mpc < 
r < 13 h~^Mpc as expected, however, the ratio increases again at larger scales. If we exclude the galaxies 
in the Sloan Great Wall region (ll'' < R.A. < 14''), the £,{s)/£,{r) ratio indeed does not show the up- 
shooting anymore. Nevertheless, there are no appealing arguments about whether chopping off galaxies in 
the Sloan Great Wall guarantees fairness, we just pick up points in the scale range (4, 13 /i^^Mpc) to fit to 
a constant and get £(s)/$( r) = 1.29 ±0.1 4, or /3 = 0.39 ± 0.17, considering the fact that the final ai2 is 
not sensitive to the /? at all jLi et aUl2"006al) . 

The (T12 as a function of projected separation a shown in Fig. [3] has a classical shape as many other 
measurements (e.g. Jing & Borner, 2004; Li et al., 2007): <ti2 rises from 650 kms^^ up to 750 kms^^ as 
(T increases from 0.1 h^^Mpc to 0.3/i^^Mpc, forms a plateau till to ct ^ 1 h^^Mpc, then drops down 
again at larger scales. If a scale independent (T12 is assumed, the best fitted ai2 = 685 ± 17 km s^^ within 
0.1 /i~^Mpc < r < 18 /i~^Mpc. Our (T12 are significantly higher than that of 6j-band sam ple (2dFGRS, 
iHawkins etaill2003h and slightly larger than the r-band sample (SDSS. IZehavi et al.ll2002l) . 

4 COLOR SUB-SAMPLES 

Figure |4] displays the clustering dependence on the color of galaxies, from ^(s) and Wp as numerous works 
already revealed that spatial clustering for red galaxies is much stronger than for blue galaxies (see also 
Table [T]). The slope of the 2PCF is a direct indicator of strength of the galaxy dynamic non-linearity, thus 
red galaxies with steeper 2PCF are more harassed by the gravitational turbulence at small scales or local 
structures than blue galaxies. In the scheme of halo models, the slope of 2PCF is determined by the percent- 
age of contribu tion from the one - halo t erm and two-halo terms, rather than the mass of the halo in which the 
galaxy resides ( Coorav & Shethl [20021) . A steeper 2PCF at small scales contains more power from the one- 
halo term. If galaxies are simply centrals and satellites in dark halo, it leads to a conclusion that the satellite 
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Fig. 4 g — r color dependence of 2PCFs and cri2. All error bars are derived from jackknife 
sub- samples. 



fraction in the red sub-sample is higher than in the blue sub-sample, although red galaxies incline to occupy 
more massive halos than blue galaxies. In fact the interpretation is confirmed by the velocity information: 
(Ti2((t) of blue galaxies is very flat and has much lower amplitude than red galaxies over wide range of 
projected separation, which is exactly wh at is observed in simulations when reducing satellite fraction in 
mock galaxy samples (ISlosar et al.ll2006l) . We also note that the color dependenc e of a^2 is very similar to 
the simulation results of the old and young populations galaxies demonstrated by [Weinberg et al.l (|2004|) . 

5 VOLUME-LIMITED SUB-SAMPLES AND COMPARISON WITH MOCK CATALOGUE 

We construct four volume-limited sub-samples to investigate the luminosity dependence of clustering 
(Table|2|i. The main sample is divided to 4 absolute magnitude bins centered approximate ly from M* + 2 t o 
M* — 1, where M* = —23.38 is the characteristic luminosity of the Schechter function dSchechteii 1 1 976h . 
The number density for galaxies in the lowest luminosity bin is 8 times higher than that in the highest 
luminosity bin. 
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Figure |5] shows the projected correlation functions Wp(o') of volume-limited samples, and Table|2]lists 
the parameters vq and 7 for power law £_{r) models by fitting Wp{rp) in the range 0.1 /i~^Mpc < a < 
13 /i~^Mpc. The slopes of these sub-samples are roughly constant with errorbars, indicating weak depen- 
dence on lumin osity; but the correlation length rp does increase w ith luminosity, thus being consistent with 
earUer studies dNorberg et all I2OOII: IZehavi etall 120051: iLi et all |2006b), and the tq - L relation follows 
the same law as the relations in other bands and simulations (Figure|5]l- 

We also explore the relative bias factor b/b* computed using the ratio of the Wp{2.7 h^^Mpc) of our 
four sub-samples to the Wp{2.7 h~^Mpc) of the -24 < Mk < -23 sub-sample (L w L*). This fidu- 
cial separation of 2.7 /i~^Mpc is chosen because it is well out of the extremely non-linear regime, but still 
small enough so that the correlation functions are very accurately measured for a ll sub-samples . In th e 
bottom right panel of Fig ure [51 we compa red our measured b/b* to the models of iNorberg et al.l (1200 lb . 
iTegmark et alj (|2004| ) and Li et al. ('2006b') togethe r with data fr om simulations. Within the estimated un- 
certainties, the models of Norberg et al. (200l5 and lTegmark et al. (2004) are basically consistent with our 
estimations, but the model of Li et al.l (I2006bl) and the semi-analytical model fail to produce enough clus- 
tering power in the highest luminosity bin. 

Th e lu minosity depend ence of ai2 is rather much complicated as already reahzed by Ijing & Bornej 
(|2004') and lLiet al.l ('2006a'). It is pointed out that, at small separation, 0-12 has a trough at L*, becomes 
relative flat at lower luminosity end and increasing rapidly at L > L*. This has not been reproduced by any 
current halo models. Figure|6]shows that the ai2 curve of L* sub-sample at cr < 1 h^^Mpc is fairly flat and 
has smaller amplitude than other luminosity sub-samples, implying a relative smaller fraction of satellites 
within the luminosity bin or the stellar mass range. Interestingly that (J12 curves of the brightest sub-sample 
and the faintest sub-sample have very similar shape although amplitudes differ slightly, it seems that the two 
sub-samples contain similar fraction of satellites although their occupied halos have very different masses, 
since the brightest sub-sample has a much bigger bias than the faintest sub-sample. 

The semi-analytical model works well for galaxies with low K-band luminosity, but displays significant 
discrepancies at high luminosity end de spite the rough agreement in spatial clustering (Table |2]l. The case 
in K-band is different to the discovery of iLi et al] (|2007|) at r-band where there are less differences between 
the semi-analytical model and the data at lower luminosity end. However the inconsistency of the semi- 
analytical model may be cuased by the limited size of the simulation, henceforth deficiency in large mass 
halos to provide sufficient number of galaxies of high stellar masses. 

6 SUMMARY AND DISCUSSION 

For furnishing the clustering evolution of NIR galaxies, a local galaxy catalogue limited by K-band magni- 
tudes of 10 < X < 13.5 is generated by cross-matching the 2MASS data with the SDSS survey, then we 
carefully measured the 2PCFs and the pairwise velocity dispersion of the galaxy sample, as well as their 
color and luminosity dependence respectively. 

In the redshift space, the 2PCF ^(s) of the flux-limited sample complies with shallower power law 
at scales of s <^ 3 h^^Mpc than that at larger scales, but in real space the 2PCF ^(r) derived from the 
projected 2PCF w{a) is perfectly approximated by a single power law with ro — 6.44 ± 0.23 /i^^Mpc 
and 7 — 1.8 1 zt 0.02 for r < 1 ~K)/i~^Mpc (Figure |2]i. The estimated correlation length supports the 
conclusion of lWaddington et aL I (l2007h that the clustering of NIR galaxies evolves very slow upto z ^ 0.5. 
The pairwise veloc ity dispersion at small scales is directly related to the spatial distribution in the halo 
(ISlosar et al l l2006h . our (712 of local NIR galaxies has shape analogous to those found in simulations, 
showing a bump at scales around 1 h^^Mpc, but is larger than in optical bands, being 685 ± 17 kms^^ if 
assuming scale invariant ai2 (Table |3]l. The phenomenon could be rooted in different luminosity functions 
at these bands, it is likely that the distribution of A'-band luminosity or stellar mass of satellite galaxies 
is more concentrated in the range defining our sample than that of central galaxies. Regrettably we do not 
have a large NIR galaxy sample at high redshift with redshift, otherwise comparison of (T12 would enable 
us peering into the evolution of the positioning of NIR galaxies in their host halos, or the velocity biasing 
of galaxies relative to the dark matter. 

The optical color dependence of the clustering of NIR galaxies is similar to optical bands selected 
galaxies, blue galaxies of g — r < 0.7 are much less clustered than red galaxies, and displaying a very flat 
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and low ai2 in nonlinear regime as consequences of having smaller fraction of satellite galaxies. And the 
luminosity dependence of 2PCF and ai2 of our NIR galaxies is not different with that of optical galaxies 
in spite of the very different luminosity functions. As the i^T-band luminosity is tightly correlated with the 
stellar mass, it i s someh ow surprising to discover that the empirical formula of relative biasing by stellar 
mass in lLi et al\ (l2006bh under-predict the bias of the brightest volume-limited sample. Also, the transition 
of the shape and amplitude of <ti2 at L* might infer that the luminosity distribution of satellite galaxies in 
low mass halos are very different to that in high mass halos. 

Examination of our galaxy sample against mock galaxy sample reveals that the i^T-band luminosity de- 
pendence of the 2PCF can be approximately reproduced by semi-analytical modelling within measurement 
errors, but the 0-12 of the mock deviates from observation significantly in aspects of amplitude, shape and 
luminosity dependence, especially for bright galaxies. The peculiar velocity of galaxy is a conundrum in 
galaxy formation models both in theories and simulations, accurate modelling requires exquisite fabrication 
of galaxies in halos and corresponding evolution paths, it seems so far there is still a long way to go. 
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Fig. 5 Top: projected correlation functions Wp{<T) for the volume-limited sub-samples. Bottom: 
luminosity dependence of the comoving correlation lengths and the linear bias parameters of 
different galaxy samples and the mock galaxy sample. The relative bias factors b/b* is defined 
by the amplitude of Wp(cr) measured at a = divided by the same quantity of the 

-24 < Mk < -23 sub-sample (L sa L*), the dotted curve is at the 6j-band of the 2dFGRS 
survey b/b* = 0.85 + 0.15L/L* (Norberg et al., 2001), the soHd curve is from the SPS S r-band 
power spectrum b/b* = 0.85 + 0.15L/L* - 0.04(M - M*) (iTegmark et all |2004 . and the 
dashed c urve is a fit to the Wp{a) of stellar mass sub-samples b/b* = 0.896 + 0.097M/M* 
dLi etalir2006b) in which M and M* refer to stellar mass. 
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Fig. 6 (712 of volume limited sub-samples (circles) against semi-analytical modelling (squares). 



